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Long-term strain improvements through repeated mutagenesis and screening have generated a 
hyper-producer of cellulases and hemicellulases from Penicillium decumbens 114 which was isolated 30 
years ago. Here, the genome of the hyper-producer P. decumbens JU-AIO-T was sequenced and compared 
with that of the wild-type strain 1 14-2. Further, the transcriptomes and secretomes were compared between 
the strains. Selective hyper-production of cellulases and hemicellulases but not all the secreted proteins was 
observed in the mutant, making it a more specific producer of lignocellulolytic enzymes. Functional analysis 
identified that changes in several transcriptional regulatory elements played crucial roles in the cellulase 
hyper-producing characteristics of the mutant. Additionally, the mutant showed enhanced supply of amino 
acids and decreased synthesis of secondary metabolites compared with the wild-type. The results clearly 
point out that we can target gene regulators and promoters with minimal alterations of the genetic content 
but maximal effects in genetic engineering. 
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The rapid increase in crude oil prices and decrease in crude oil reserves have significantly stimulated the 
search and use of renewable resources and energy. Lignocellulosic materials from agriculture and forest by- 
products are relatively abundant renewable resources. The use of lignocellulosic biomass for the production 
of liquid fuels and chemicals has long been considered to be an important alternative for sustaining the human 
economy and society^'^. In one of the most common biorefinery schemes, lignocellulosic materials are partially 
treated by physical and chemical methods to release cellulose and hemicellulose which then are hydrolyzed to 
fermentable sugars by a lignocellulolytic enzyme system consisting of multiple cellulases and hemicellulases. The 
sugars can be converted to ethanol or other products^ l 

Although recombinant protein expression technology has been applied to the production of most industrial 
enzymes, lignocellulolytic enzyme systems are primarily produced by fungi that are often enhanced through 
strain improvement^'^. The most widely used commercial lignocellulolytic enzymes are produced by Trichoderma 
reesei strains that have been improved through repeated mutagenesis and screening^. Mutations in three genes, 
carbon repressor gene crel (ref. 7), P-glucosidase regulator gene bglR^ and glucosidase Ila subunit gene^/s2a^, 
contribute to the hyper-production of cellulolytic enzymes in T. reesei mutant strains. Mutations in crel and bglR 
lead to reduced carbon catabolite repression, and the mutated gls2(x results in altered N-glycan patterns on 
secreted proteins. Although high numbers of genetic mutations have been found in several T. reesei mutants 
through comparative genomics analysis^°'^\ additional mechanisms accounting for the hyper-producing pheno- 
type remain enigmatic. 
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Some Penicillium species have been reported to produce more 
balanced native lignocellulolytic enzyme systems than T. reesei^^, 
and our previous study indicated that there were more diverse genes 
encoding cellulose binding domain- containing proteins and hemi- 
cellulases in the genome of cellulolytic fungus Penicillium decumbens 
than those in T. reesei^^. The original P. decumbens isolate 114 was 
obtained from decayed straw-covered soil in 1979 (ref 14), and has 
been improved through a series of mutagenesis and screening over 
the years (Fig. lA). Carbon catabolite repression (CCR) -resistant 
strain UVl 1 was first obtained by selecting a mutant forming a clear 
halo zone on a holocellulose-glucose agar plate^^. Multiple rounds of 
further mutagenesis and selection generated JU-AIO (ref. 15), fol- 
lowed by JU-AIO-T, which has been used for industrial- scale cellu- 
lase production in China for 17 years with a productivity of 160 lU 
L"^ h"^ (ref 16). The activities of the lignocellulolytic enzyme system 
of JU-AIO-T increased several times compared to those of the 
sequenced strain 1 14-2. However, the molecular mechanisms behind 
the strain improvement for higher cellulase production are poorly 
understood. 

In this study, comparative and functional genomics studies of 
P. decumbens mutant JU-AIO-T and wild-type strain 114-2 were 
performed to decipher how strain improvement has significantly 
improved the production of the lignocellulolytic enzyme system. 
Mutations in gene regulators and promoter regions partially 
accounted for the increased production of cellulases and hemicellu- 
lases. Transcriptome analysis further revealed clues for the hyper 
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Figure 1 | P. decumbens mutagenesis and screening. (A) Genealogy of 
strains used. The two strains used for genome sequencing are labeled in red. 
Strains shown in gray have been lost over the years. (B) Production of 
lignocellulolytic enzymes by 114-2 and JU-AIO-T. The two strains were 
grown in the cellulose-wheat bran medium for 4 days. FPase stands for 
filter paper enzyme activity. Error bars represent standard deviations of 
triplicate independent cultures. (C) Colony morphology of 1 14-2 and JU- 
AlO-T on cellulose plates. Strain JU-AIO-T generated larger cellulolytic 
halo than strain 114-2 did. 



production of lignocellulolytic enzymes in the mutant. Collectively, 
the results offer a multi-level understanding of enzyme hyper-pro- 
duction by P. decumbens strain JU-AIO-T, and the findings will 
provide guidance for targeted genetic modification to enhance the 
production of lignocellulolytic enzymes. 

Results 

The different phenotypes between wild-type strain 114-2 and 
mutant JU-AIO-T. The mutant JU-AIO-T produces ~9-fold 
higher cellulase activity (measured as filter paper enzyme, FPase), 
~8-fold higher xylanase (a major type of hemicellulase) activity, and 
~4-fold higher total secreted proteins than wild-type strain 1 14-2 in 
cellulose-wheat bran (CW) medium, a typical cellulase producing 
medium for P. decumbens^^ (Fig. IB). Thus, the strain improve- 
ment has resulted in a mutant that produces more secreted pro- 
teins, especially cellulases and hemicellulases. In addition, strain 
114-2 produces dark green conidia, while JU-AIO-T produces 
much fewer and whitish-pink conidia on solid agar plates (Fig. IC). 

Global comparison of genomes of 114-2 and JU-AIO-T. The 

genome of strain JU-AIO-T was sequenced and assembled inde- 
pendently of that of 114-2 (Supplementary Table SI). The genome 
was assembled into 482 contigs totaling 30.69 Mb with 96.3% 
mapped onto the 8 chromosomes and the mitochondrial genome 
of 114-2 (Supplementary Fig. SI). The aligned DNA had an aver- 
age of 1.4 single variations per kilobase (SNVs/kb; Supplementary 
Fig. S2A), less dense than that (7.8 SNVs/kb) between two Asper- 
gillus niger strains Gene curation using transcript omic data yielded 
10,473 gene models in JU-AIO-T (Supplementary Table S2). Strains 
114-2 and JU-AIO-T shared 9,599 proteins with an average amino 
acid identity of 99.55%, including 6,699 proteins with 100% identity 
over the full length (Supplementary Fig. S2B). The average protein 
identity was slightly lower than that (99.96%) between two P. 
digitatum isolates Proteins with sequence differences between 
114-2 and JU-AIO-T were enriched for those with transcription 
factor activities (P- value = 2.77 X 10"^). Manual check of sequ- 
ence alignments suggested that 140 of the 476 predicted transcrip- 
tion factors contained amino acid sequence differences between the 
strains (Supplementary Table S3). Strain- specific genes were mainly 
found in a few genomic regions, encoding primarily proteins of un- 
known function, transposases, vegetative incompatibility proteins, 
and proteins involved in biosynthesis of secondary metabolites 
(Supplementary Table S4). The features of strain-specific genes^° 
indicate that the parent isolate 114 was likely a heterokaryon, and 
current strains 114-2 and JU-AIO-T have been purified by subse- 
quent transferring or screening processes (Fig. lA). 

Comparison of genes encoding lignocellulolytic enzymes. We have 
previously annotated 18 cellulases and 51 hemicellulases in 114-2 
(ref 13). JU-AIO-T carried 66 of these enzymes, and it did not 
contain three adjacent hemicellulase genes in 114-2 scaffold 8 
(PDE_08036, PDE_08037 and PDE_08038). The shared proteins 
were mostly identical. Only 26 amino acid differences were identi- 
fied in 16 cellulases or hemicellulases between the strains, all occur- 
ring at non-conserved positions (Supplementary Table S5) that are 
not expected to affect enzyme activities^\ 

JU-AIO-T increased secretion of lignocellulolytic enzymes and 
decreased secretion of amylases and proteases. Activity assays 
showed that extracellular cellulase and hemicellulase activities were 
both elevated, while the p-glucosidase activity was slightly reduced in 
JU-AIO-T relative to those in 114-2 (Fig. IB). The difference indi- 
cated possible altered secretome composition between the strains. 
When the secretomes of 114-2 and JU-AIO-T were analyzed using 
liquid chromatography- tandem mass spectrometry (LC-MS/MS), 90 
proteins were identified (Supplementary Table S6). Proteins 
involved in the degradation of cellulose, hemicellulose, starch and 
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proteins were detected in the secretomes of both strains. However, 
the proportion of cellulases and hemicellulases increased, while the 
production of amylases, proteases and other proteins decreased 
markedly in the JU-AIO-T secretome, according to peptide spec- 
tral counts of LC-MS/MS^^ (Fig. 2A). The result was confirmed by 
comparing the two-dimensional electrophoresis map of JU-AIO-T 
secretome (Supplementary Fig. S3 A) with that of 114-2 (ref 13). 
Nearly all the protein spots on gel for JU-AIO-T were identified as 
lignocellulolytic enzymes. Consistently, extracellular amylase 
activity and protease activity of JU-AIO-T were significantly lower 
than those of 114-2, respectively (Supplementary Fig. S3B). To 
further assess the changes in secretome composition, transcripts of 
the strains were analyzed (Supplementary Table S7). Transcripts of 
the 34 experimentally detected cellulases and hemicellulases 
accounted for 1.07% of total transcripts in strain 114-2, and the 
percentage increased to 9.03% in JU-AIO-T (Fig. 2B and Tables 
S6). On the flip side, transcription levels of amylase and protease 
genes were lower in JU-AIO-T. The expression of the principal P- 
glucosidase BGLI gene was essentially unchanged in JU-AIO-T. 

Genetic and transcriptional changes of transcription factors 
altered secretome composition in JU-AIO-T. Like the first ob- 
tained mutant UVll, JU-AIO-T showed CCR-resistant expression 
of cellulases and hemicellulases (e.g. higher expression in glucose 
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Figure 2 | Comparison of expression levels of secreted enzymes between 
strains 114-2 and JU-AIO-T. (A) Secretome comparison. Protein 
abundance ratios are calculated according to peptide spectral counts in LC- 
MS/MS. Proteins with lower abundances in JU-AIO-T are shown in gray 
background. The 18 major proteins were at least 1% in at least one of the 
two secretomes. Glycoside hydrolases (GHs) were named according to the 
CAZy families they were classified, except BGLI which belongs to GH 
family 3. Chi 18 A, a chitinase; Mur25A, a muramidase; HPl, a hypothetical 
protein. (B) Transcription level comparison. All the proteins detected in 
LC-MS/MS and 2DE-MS/MS were included. Detailed protein abundance 
ratios and transcription levels are listed in Supplementary Table S6. 



medium, Supplementary Fig. S4A). The gene creA (PDE_03168) in 
P. decumbens, a homolog of T. reesei crel (ref. 7), encoding the key 
carbon catabolite repressor, had a frameshift mutation at the C- 
terminus in JU-AIO-T (Supplementary Fig. S4B). The effect of 
CreA on the expression of extracellular hydrolytic enzymes was 
investigated by deleting the creA gene in strain 114-2. As expected, 
deletion of creA resulted in CCR-resistant expression of cellulases 
(Supplementary Fig. S4C). In addition, the Acre A strain showed 
~ 1.7-fold higher FPase activity and ~3.4-fold higher amylase 
activity than 114-2 in cellulose medium (Fig. 3), confirming the 
negative effect of CreA on cellulase expression under both 
repressing and inducing conditions'^. 

Promoter regions (1.5-kb upstream region) of 11 cellulase genes 
contained 106 nucleotide differences between the strains 
(Supplementary Table S8). When the nucleotide differences were 
analyzed for their effects on putative binding motifs of CreA, one 
SNV in the promoter of a cellobiohydrolase gene cel7A-l was found 
to affect a putative CreA-binding site (two adjacent 5'-SYGGRG-3' 
motifs'^'^^; Supplementary Fig. S5). Further, Cel7A-l had the highest 
fold change among the cellulases up-regulated in JU-AIO-T 
(Supplementary Table S6). To investigate the mutation on transcrip- 
tion efficiency, the EGFP (enhanced green fluorescent protein) gene 
was fused to cel7A-l promoters from 114-2 and JU-AIO-T, respect- 
ively, and then transformed into 114-2. Real-time quantitative RT- 
PCR analysis and microscopy analysis showed that the cel7A-l pro- 
moter from JU-AIO-T had a ~2.6-fold higher transcriptional effi- 
ciency than the promoter from 114-2 (Fig. 4). 

In Aspergillus species, transcriptions of amylase genes are coordi- 
nately activated by the transcription factor AmyR'^. The AmyR 
homologs in P. decumbens strains 114-2 and JU-AIO-T (PDE_ 
03964 and PDT_04193, respectively) were 100% identical, with a 




114-2 Acre A AamyR 

Figure 3 | Effect of deleting creA or amyR in strain 114-2 on the 
production of extracellular enzymes. Mycelia from 2% (w/v) glucose 
medium were grown in 1% (w/v) cellulose medium for another 48 h. 
Enzyme activities (U/ml) and protein concentrations were determined in 
the culture supernatant and expressed as relative values with those of 1 14-2 
set at 1. Error bars represent standard deviations of triplicate independent 
cultures. Statistical significance of the difference between 114-2 and each 
gene deletion mutant is shown. *, P < 0.05; **, P < 0.005. 
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Figure 4 | Transcription efficiencies of cel7A-l promoters from strains 
114-2 and JU-AIO-T. Strains were grown in 1% (w/v) glucose medium or 
1% (w/v) cellulose medium for 48 h. Transcription efficiency was analyzed 
by real-time quantitative RT-PCR of the reporter eGFP gene. 
Transcription levels of eGFP gene were normalized to 100-fold of those of 
(3-actin. Error bars represent standard deviations of triplicate 
measurements. FM, fluorescence microscopy; LM, light microscopy. 
White bar =10 [im. 

much lower gene transcription level in JU-AIO-T than that in 114-2 
(> 15-fold, Supplementary Dataset SI). Deletion of amyR in 114-2 
dramatically decreased amylase activity by 94.2% (Fig. 3). Notably, 
the deletion also resulted in about 1.9-fold higher FPase activity, 
suggesting a negative role of AmyR in the expression of cellulase 
genes in strain 114-2. 

Comparative transcriptome analysis revealed reorganized meta- 
bolism in JU-AIO-T. The gene expression profiling data covered 
—80% of the whole predicted genes (Supplementary Table S7), 
which aflowed the study of genome-wide gene expression changes 
between the strains. Notably, JU-AIO-T had remarkably more genes 
transcribed at low levels (< 10 transcripts per million clean tags) 
than 114-2 did (Fig. 5A). This indicated that some biological 
processes might be less active in JU-AIO-T than those in 114-2. 
Correspondingly, 567 genes were significantly up-regulated, whfle 
up to 1,447 genes were significantly down-regulated in JU-AIO-T 
(Supplementary Dataset SI). 

Compared with 114-2, genes significantly up-regulated in JU- 
AlO-T predominantly coded extracellular polysaccharide hydrolases 
(Fig. 5B and Supplementary Table S9). Genes involved in glycolysis 
and the citric acid cycle had minimal variations in expression, but the 
expression levels of genes for the pentose phosphate pathway (PPP) 
were up-regulated in JU-AIO-T, such as the putative glucose-6-phos- 
phate dehydrogenase (PDE_01924) gene which was up-regulated ~9 
folds (Supplementary Table SIO). The data suggest increased flux 
through PPP, which provides NADPH and precursors for amino 
acids biosynthesis. Interestingly, the genes responsible for channel- 
ing xylose and arabinose, the hydrolytic products of hemiceflulose, 
into PPP for consumption were also over- expressed in JU-IOA-T 
(Supplementary Table SIO). For amino acid biosynthesis, the genes 
of the lysine and cysteine biosynthesis were up -regulated by 2 to 7 
folds (Supplementary Table Sll), which may facflitate the synthesis 
of the four major (hemi-)cenulolytic enzymes rich in lysine and/or 
cysteine (Supplementary Fig. S6). Ribosomal protein genes were sig- 
nificantly enriched in the up-regulated gene set (FDR = 7.31 X 10"^, 
Supplementary Table S9). In addition, five genes involved in protein 



folding were significantly up-regulated in JU-AIO-T (Supplementary 
Table SI 2). Putative protein disulfide-isomerase PDE_06215 and 
molecular chaperone PDE_08980 genes were up-regulated by 6.0 
and 2.7 folds, respectively, which may aid the hyper-production of 
secreted proteins in JU-AIO-T. 

The down-regulated gene set in JU-AIO-T was significantly 
enriched for oxidoreductases involved in metabolism (Fig. 5B and 
Supplementary Table S9). Of the 173 down -regulated oxidoreduc- 
tases, 141 were mapped to KEGG pathways^^, including 35 enzymes 
involved in degradation of amino acids (especially aromatic amino 
acids; Supplementary Table SI 3) and 29 genes responsible for bio- 
synthesis of secondary metabolites. Further examination found that 
104 out of 3 14 genes in predicted secondary metabolism gene clusters 
were significantly down -regulated, whfle only 1 1 were significantly 
up-regulated in JU-AIO-T (Supplementary Table S14). 

Discussion 

Sequence analysis shows that the increased lignoceflulolytic enzyme 
activities in mutant JU-AIO-T are not due to the mutations in cata- 
lytic amino acid residues. This is not a surprise as the same conclu- 
sion has been reached with T. reesei mutant strains that over-produce 
ceflulolytic enzymes Our sequence analysis also points to the 
enriched mutations in transcription regulators. Through functional 
experiments in wfld-type 114-2 as the reference strain, some of the 
transcription factors are shown to be involved in the hyper-pro- 
ducing characteristics of JU-AIO-T. First, global transcription factor 
CreA has a frame-shift mutation at the C-terminus, changing the 
sequence of the last 16 amino acid residues. The crel in T. reesei 
hyper-producing mutant Rut-C30 has a nonsense mutation that also 
enhances cellulase production^. This coincidence suggests that creA 
is a key target in genetic engineering of fungal strains for higher 
production of ceflulolytic enzymes. Considering that glucose- 
resistant expression of cellulases has been observed in strain UVll 
(Fig. lA), the frameshift mutation in creA might occurr in the early 
phase of mutagenesis and screening^^. 

Secondly, down -regulation of AmyR expression contributes to the 
hyper-production of ceflulolytic enzymes in JU-AIO-T. AmyR was 
previously regarded as an amylase-specific regulator and was 
recently shown to also regulate the expression of P-glucosidases 
and galactosidases in A. niger^^. In A. oryzae, mutation in amyR 
causes down-regulation of amylases and up-regulation of ceflulolytic 
enzymes only in the presence of starch, and a CCR-involved mech- 
anism is proposed to account for the increased ceflulolytic enzymes^^. 
A CCR-involved mechanism is logic as AmyR activates the synthesis 
of amylases that convert starch to glucose, and the latter inhibits the 
production of ceflulolytic enzymes through CCR. However, AmyR 
may also directly repress the transcription of ceflulolytic enzyme 
genes, as shown in our medium without starch (Fig. 3). To the best 
of our knowledge, this is the first report of a regulatory effect of 
AmyR on cellulose-induced ceflulase expression in fungi. 

Comparative analysis of N. crassa^^ and T. reesei^^ strains with 
different production levels of ceflulolytic enzymes have shown the 
up-regulation of protein synthesis and processing-related proteins in 
fungal strains that over-produce ceflulolytic enzymes. Our global 
gene expression analysis show simflar changes between JU-AIO-T 
and 114-2. In addition, we have identified elevated transcription of 
genes involved in lysine and cysteine biosynthesis required for the 
production of major lignoceflulolytic enzymes in JU-AIO-T. In sum- 
mary, mutations in the key gene expression regulators and promoter 
regions, together with other functional mutations to be identified, 
made strain JU-AIO-T a hyper-yield and "specific" producer of lig- 
noceflulolytic enzymes compared with the versatfle wfld-type strain 
114-2. Activities facflitating the production of lignoceflulolytic 
enzymes are enhanced and other activities, such as the synthesis of 
secondary metabolites, are decreased. The results clearly dem- 
onstrate that we may target gene regulators and promoters with 



SCIENTIFIC REPORT: | 3 : 1569 | DOI: 1 0.1 038/srepO 1569 



4 



A 



Distribution of gene transcription 



TPM ranges 




1114-2 
iJU-AIO-T 



'^eQ^^ Qs^o^^ o^^o^^ 



0 0-10 
Low expression 



10-50 50-100 100-500 
Moderate expression 



500-1000 1000-2000 >2000 
High expression 



B 



Function enricliment of 
differentially expressed genes 





GO-ID 


Description 


FDR 




GO:0005576 


extracellular region 


8.12E-22 




00:0004553 


hydrolase activity, hydrolyzing 0-glycosyl 
compounds 


1.42E-18 




00:0005975 


carbohydrate metabolic process 


2.13E-18 


Up- 
Regulated 


00:0030248 


cellulose binding 


2.48E-18 


00:0016798 


hydrolase activity, acting on glycosyl bonds 


1.14E-17 


00:0001871 


pattern binding 


2.84E-15 




00:0030247 


polysaccharide binding 


2.84E-15 




00:0030246 


carbohydrate binding 


4.26E-15 




00:0000272 


polysaccharide catabolic process 


4.68E-14 




00:0016052 


carbohydrate catabolic process 


4.45E-13 




00:0005506 


iron ion binding 


2.01 E-05 


Down- 
regulated 


00:0020037 
00:0046906 


heme binding 
tetrapyrrole binding 


2.01 E-05 
2.01 E-05 


00:0055114 


oxidation reduction 


3.19E-03 




00:0016491 


oxidoreductase activity 


5.86E-03 



Figure 5 | Genome-wide gene expression changes between strains 114-2 and JU-AIO-T. (A) Distribution of gene transcription levels. TPM, transcripts 
per million clean tags. Note that the percent of genes with very high transcription levels (TPM > 2,000) in JU-AIO-T is higher than that in 1 14-2. (B) Gene 
Ontology (GO) enrichment of genes of significantly differential expression between JU-AIO-T and 114-2. For up-regulated genes, only the top 10 
significantly enriched GO terms are shown. The full results of gene function enrichment including gene IDs are listed in Supplementary Table S9. 



minimal alterations of the genetic content but maximal effects in 
genetic engineering. 

Methods 

Strains and culture conditions. P. decumbens strains 114-2 (CGMCC 5302) and JU- 
AlO-T were used in this study. All culture media were based on modified Mandels' 
salt solution (MMS), containing (g L'^): KH2PO4 3.0, NaNOg 2.6, MgS04-7H20 0.5, 
CaCl2 0.5, FeS04-7H20 0.0075, MnS04-H20 0.0025, ZnS04-7H20 0.0036 and 
CoCl2*6H20 0.0037. Conidial suspensions were inoculated at a final concentration of 
10*^ per ml, and strains were grown at 30°C, 200 rpm (for liquid culture). For 
secretome analysis, digital gene expression profiling, and comparison of extracellular 
enzyme activities between 114-2 and JU-AIO-T, strains were grown in MMS 
supplemented with 0.1% (w/v) peptone and 1% (w/v) microcrystalline cellulose plus 



1% (w/v) wheat bran (CW medium) for indicated time periods. MMS supplemented 
with 0.1% (w/v) peptone and 2% (w/v) glucose (G medium) was also used in 
comparison of extracellular enzyme activities. For comparison of extracellular 
enzyme activities between 1 14-2 and transcription factor gene deletion mutants, 
conidia were pre-cultured in G medium for 24 h. Then, mycelia were collected by 
vacuum filtration, washed with sterile water and re- suspended in MMS supplemented 
with 1% (w/v) microcrystalline cellulose for further growth. 

Protein concentration and enzyme assays. Protein concentration and extracellular 
hydrolase activities were assayed as previously described^^. For FPase assay, dilutions 
of supernatant (0.5 ml) were incubated with Whatman No.l filter paper (1X6 cm, 
50 mg) in 1.0 ml of 0.2 M acetate buffer (pH 4.8) at 50°C for 60 min. Statistical 
significance tests were carried out using Microsoft Excel program with a one-tailed 
equal variance t-test. 
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Genome sequencing, assembly and annotation. The genome of JU-AIO-T was 
sequenced with 454 GS FLX Titanium platform and the Illumina Genome Analyzer. 
Genome assembly was performed with the standard 454 assembler Newbler^^ version 
2.3 to generate contigs. Illumina mate-paired reads of strain JU-AIO-T were mapped 
to contigs from 454 assembler to generate scaffolds. Gene models were predicted 
independently with a set of gene finders as performed in strain 114-2 (ref 13), and 
then manually curated with 454-sequenced transcriptome sequences (NCBI 
Sequence Read Archive database accession no. SRA048522) and nr BLAST^^ hits to 
justify intron-exon boundaries and UTR regions. Blast2GO^^ was used for Gene 
Ontology (GO)^^ and InterPro^^ annotation. Transcription factors were annotated 
according to the InterPro IDs in Fungal Transcription Factor Database^*^. Secondary 
metabolism gene clusters were predicted with SMURF^^. Protein sequences from the 
eukaryotic genomes in KEGG GENES database^^ (Version 57) were compiled as 
BLAST library, and KO (KEGG Ortholog) identifiers were assigned with KOBAS'' 
algorithm with a combination of thresholds (e-value < le-5 and rank < 5). 

Genetic variation analysis. The genomes of 1 14-2 and JU-AIO-T were aligned using 
mauve^^ '*" and Mummer'*\ In addition, sequencing reads of one strain were mapped 
to the genome of the other strain using BWA^^-^^, Bowtie^^, SHRIMP*^, gsMapper^^ 
and MOSAIK- ALIGNER (Version 1.1. 0021, http://code.google.com/p/mosaik- 
aligner/). SNVs were called by Samtools and VarScan**^ via the resulted SAM or BAM 
files from the aligners. The SNVs were then filtered by composition complexity, 
where those with more than five homo-polymers in 21 bp-windows were discarded. 
SNVs supposed by both reads level and genome level were retrieved as the final SNV 
set. Strain- specific genes were manually checked to remove false positive results 
caused by sequencing gaps or mistaken gene predictions. 

Secretome analysis. Culture broths of strains grown in CW medium for 48 h were 
centrifuged at 12,000 X ^ at 4° C for 20 min and the supernatants were analyzed by 
reversed-phase (RP) LC-MS/MS and two-dimensional electrophoresis (2DE)-MS/ 
MS techniques. RP-LC-MS/MS and data analysis were performed as previously 
described^^. The spectral counting method was used to determine the abundance of 
proteins in the secretomes*^. The secretome of JU-AIO-T was also analyzed by 2DE 
using Ettan DIGE System (GE Healthcare) in dark as done in 1 14-2 (ref. 13). Spots of 
interest were manually excised, washed and trypsin-digested for MALDI-TOF/TOF 
mass spectrometry on 4700 Proteomics Analyzer (Applied Biosystems) in the positive 
ion reflector mode. An in-house P. decumbens protein sequence database was used for 
protein identification. 

Digital gene expression profiling. Strains grown in CW medium for 41 h were used 
for total RNA extraction using the RNAisoTM reagent (TaKaRa, Japan) according to 
the manufacturer's protocol. Digital gene expression profiling experiment based on 
Solexa sequencing'^*^ was performed by Beijing Genomics Institute (BGI) at Shenzhen, 
China. All clean tags obtained after raw data processing were mapped to the reference 
tag database (predicted transcripts plus downstream 300-nucleotide sequences) of the 
corresponding strain. Only sense expression was considered and no mismatch was 
allowed. The copy number of unambiguous tags (tags mapped to one single gene) for 
each gene was normalized to TPM (number of transcripts per million clean tags). 
Genes of significantly differential expressions were identified by a significance test'^'' 
with combined thresholds (FDR^° < 0.001 and fold change > 2). Balst2GO^^ was used 
for function enrichment analysis of gene sets with the threshold of FDR < 0.05. 

Gene targeting of creA and amyR. The gene creA was deleted using AcreA::hph 
targeting cassette carrying hygromycin B phosphotransferase (hph) expression 
cassette flanked by 5' and 3' sequence of creA gene^\ Retransformation of AcreA 
strain was performed using creA::ptrA targeting cassette carrying pyrithiamine 
resistance gene {ptrAY^ expression cassette flanked by creA expression cassette and 
downstream sequence of creA gene. The gene amyR was deleted in 114-2 using 
AamyR-.-ptrA targeting cassette carryingpfrA expression cassette flanked by 5' and 3' 
sequence of amyR gene. Construction of targeting cassettes and transformation of 
P. decumbens were performed as previously described^\ and successful targeting was 
confirmed by Southern blot. 

Promoter efficiency analysis. The reporter gene eGFP was introduced into the vector 
pDHt/SK under control of cel7A-l promoters from 114-2 and JU-AIO-T, 
respectively. The expression vectors were transformed into the recipient 1 14-2 via 
Agrobacterium- mediated fungal transformation. Transcription levels of eGFP and P- 
actin genes were analyzed by real-time quantitative PGR (Eppendorf, Germany), in 
which the primer pair 5'-AGTGCTTCAGCCGCTACCC-3' and 5'-GATGCCG- 
TTCTTCTGCTTGTC-3' were used for amplification of eGFP gene, and 5'-TGAG- 
GCTCCCATCAACCC-3' and 5'-CACCAGAGTCCAGCACGATA-3' were used 
for amplification of P-actin gene (PDE_01092). 

Accession numbers. The Whole Genome Shotgun project of strain JU-AIO-T has 
been deposited in DDBJ/EMBL/GenBank under the accession number AGIIOOOOOOOO. 
The digital gene expression profiling data have been deposited in NCBI's Gene 
Expression Omnibus under the GEO Series accession number GSE34288. 
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